INVENTION TITLE 



System and Method to distribute reasoning and pattern matching in forward and backward 

chaining rule engines 
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DESCRIPTION 

Background of the Invention 

[Para 1 ] The RETE algorithm was invented bv Dr. Charles Forgv in 1978 and re-published in 
1 982 titled "Rete: A Fast Algorithm for the Many Pattern/Many Object Pattern Match 
Problem", in Artificial Intelligence. 1 9 (1 982) pp. 1 7-37. RETE was originally designed to 
improve pattern matching performance. The algorithm can be divided into two parts: the 
rule compiler and the runtime. The rule compiler analyzes a set of rules and creates a 
rooted acyclic graph. The conditions of the rule are converted to one and two input nodes. 
The action of the rule is converted to a specialized 1 -input node called terminal node. The 
second part of the algorithm is the runtime, which consists of a working memory and an 
agenda. Data reouiring evaluation is added to the working memory and traverses the acyclic 
graph generated bv the rule compiler. When all conditions of the rule are satisfied, the data 
triggering the rule are added to the agenda. RETE algorithm uses match-resolve-act cycles 
to evaluate the data. When all the rules have been fired, the evaluation process is complete. 
Each one input node in the acyclic graph remembers which data satisfies the condition in an 
alpha memory. Each two input node contains an alpha memory for the right side and a beta 
memory for the left side. The two input node uses the memory to remember which data 
matches the other side. Rule engine technology i s the product of research i n Artific i a l 
Inte lli gence and intel li gent systems. Throughout the 1970's, 80's and 90's, researchers 
hav e so l v e d some complex computing chal le ng e s. On e of th e most effic i ent and w e l l tested 
a l gor i thms is RETE. I t was or i g i na ll y descr i bed by Charles Forgy. 

[Para 2] RETE uses the one input nodes to evaluate literal constraints, and two input nodes 

to evaluate joins between two data elements. An example of a literal constraint might be 

"the state is MA". An example of a join might be "the zip code of the billing address is egual 

to the zip code of the mailing address." The rule compiler analyzes the rules for similarities 
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and organizes the nodes to optimize performance and scalability. Forward chaining rule 
engines are said to be data driven. The rule engine can onlv reason over data that is added 
to the working memory. 
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Summary of the Invention 

[Para 3] The invention solves performance issues previously ignored bv researchers in the 
field of artificial intelligence. Using existing techniques, a rule engine implementing Dr. 
Forgv's original RETE algorithm cannot reason over extremely large datasets or partitioned 
data. The invention solves the following limitations. 

o Dividing the pattern matching between multiple rule engines and share the 

partial and complete matches efficiently. 
o Efficiently dividing the pattern matching across several rule engines 
dynamically. 

o Efficiently distributing the indexes of the working memory across multiple 
machines. 

o Efficiently distributing one or more conditional elements of a rule to remote 
systems and dividing the work across a cluster of computer systems. 
[Para 4] In general, one form of the invention is a computer system which implements the 
RETE extension to perform distributed reasoning. The system distributes reasoning bv 
replicating the memory indexes across the systems. This reduces the memory requirements 
of each system and reduces the quantity of data each rule engine sends to the others. Using 
this approach, each rule engine in the cluster has a subset of the entire dataset. A system 
implementing the distributed RETE extension can be used in a variety of applications like 
order management, regulatory compliance, military command control applications or 
business process automation. The invention is particularly well suited to large real-time 
systems like military command control systems or security trading systems, which partition 
data across several locations. 
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Detailed Description of the preferred embodiment 

[Para 5] RETE defines a method for converting if /then /else statements to executable code. 
The if /then /else statements represent business rules that define business decisions. For 
example, an user mav define a rule for managing orders "If the items of the order are in 
stock and the credit card is valid, then send the order to fulfillment". In the example, "if the 
items of the order are in stock and the credit card is valid" are the conditions of the rule. 
The action of the rule is "send the order to fulfillment." The rule engine translates the 
conditions of the rule to a graph also known as a directed acvclic graph. During the 
translation process, it identifies similarities between rules and optimizes the graph to 
improve performance. In the situation were multiple rules share the same condition, the rule 
engine will determine if the nodes in the graph can be shared between rules. This reduces 
the number of evaluations a rule engine has to perform on the data. An example RETE graph 
is provided in Figure 1 . RETE is capable of evaluating tens of thousands or hundreds of 
thousands of rule efficiently and only evaluates the rules that apply to the data. Prior to the 
invention of RETE. many systems had to evaluate all the rules for every piece of data. This 
means as the rule count or dataset grows, it takes longer to evaluate. A system using RETE 
is able to provide better performance than naive systems that evaluate all rules for every 
data element. 

[Para 6] RETE has several limitations, which affect the performance and scalability. When 

the dataset is over 10 million records, the system mav not have enough physical memory. In 

those situations, current systems cannot compute the result and would like fail 

catastrophicallv. Products on the market today cannot reason over extremely laroe datasets 

over 50 million records with thousands of complex business rules. 

[Para 7] Distributed reasoning, unlike collaborative agents provides methods by which 

mu l t i ple rule eng i nes reason over l arge datasets i n rea l t i me. Whereas co ll aborat i ve agents 
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use agents to reason over d i screte prob l ems, i t cannot reason over largo sots of data. 

Furthermore, col l aborative agents" techn i ques requ i re the ru l e engine to l oad all necessary 

data w i th i n the same eng i ne. For sma l l datasots, those techn i ques prove to be powerfu l , but 

thoy do not scalo for l arge datasots. The invention solves the memory limitation bv dividing 

the data into partitions. Instead of having the entire dataset in a single rule engine, the data 

is divided across dozens or thousands of rule engines across many computer systems. 

There are two primary benefits of partitioning the data. First, pattern matching is performed 

in parallel bv multiple rule engines. Second, a rule engine can reason over data in another 

engine thousands of miles awav in another location. This is achieved bv replicating the 2- 

inout node memory indexes across systems sharing the same rules or nodes. When the 

engine evaluates a fact, it uses the index to perform the comparison. Any changes to the 

indexes are sent to the other systems periodically. This enables businesses to build a large 

cluster of cheap servers to reason over hundreds of millions of records in real-time. 

[Para 8] Technically, a forward chaining rule engine utilizing distributed reasoning and 

pattern matching RETE a l gor i thm , can reason over large datasets when nodes are 

distributed to other systems. This allows the system to match on a specific i nstance of a 

class (a l so ca l led bus i ness object) data element without requiring the object instance reside 

in the same working memory and be locally available. It is important to note the engine will 

reason over shared data resident in other engines using data steams or indexes . This 

reduces the memory requirements and improves efficiency. It also enables the engines to 

share all or part of their reasoning network and distribute the process. Distributing nodes ©f 

a set of ru l os across multiple systems allows each engine to perform pattern matching on a 

single ob j ect i nstance data element and route the results to the originating system. Unlike 

load balancing techniques, a true distributed reasoning system does not require all systems 

of a cluster to deploy identical sets of rules, which creates redundant rules within the 

environment and increases maintenance costs. In a distributed reasoning/distributed 
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pattern matching system, each system deploys a different set of rules (also called module or 
rule set), based on its own needs and configuration requirements. At runtime, the rule 
engines monitor resource utilization and distribute nodes dynamically and on demand, in 
some cases, it mav be desirable to deploy the same set of rules across multiple instances. 
At runtime, the cluster would intelligently partition the data. 

[Para 9] Prior techniques research with pattern matching on extremely large datasets relied 
on load balancing techniques to prioritize and categorize atomic processes. This approach 
requires every system te have all required data locally, leading to multiple data caches. In a 
production environment with large datasets, each system cannot load all required data and 
data synchronization becomes a potential problem. 

[Para 1 0] The invention described in the patent was previously impractical due to the 
processing power of personal computers. Researchers in the field of Artificial intelligence 
did not consider data partitions with replicated indexes. The invention is currently feasible 
due to advances in the processing power of personal computers and high-speed networks. 
With this invention, it is possible to build large distributed reasoning systems using 
hundreds and thousands of personal computers. Bv distributing the indexes and facts 
between thousands of systems effectively, a forward and backward chaining rule engine can 
reason over hundreds of millions of facts using inexpensive hardware. Without this 
invention, rule engines cannot reason over extremely large datasets. 
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Description of the Drawings 

[Para 1 1 ] Fia. 1 is a network diagram of an example rule with four one input, one two input 
and one terminal node. The example rule contains one literal constraint for each object type 
and one join node between the two data elements. 

[Para 1 2] Fig. 2 is a block diagram illustrating data partitioned across several rule engines. 
Each engine has the same rules, but may also contain other rules, which are not shared bv 
other engines. A set of data is divided across several engines to improve scalabiltv and 
performance, enabling parallel processing. 

[Para 1 3] Fig. 3 is a block diagram, which shows 2 rule engines starting with different set of 
rules at startup. Some time later at T n some nodes from one rule engine are distributed to 
the other rule engine. Optionally, both system can start with the same set of rules. In both 
cases, the data is partitioned as Fig. 2 illustrates. 
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